Analysis of the Difficulties in Chinese Deep Parsing

نویسندگان

  • Kun Yu
  • Yusuke Miyao
  • Takuya Matsuzaki
  • Xiangli Wang
  • Jun'ichi Tsujii
چکیده

This paper discusses the difficulties in Chinese deep parsing, by comparing the accuracy of a Chinese HPSG parser to the accuracy of an English HPSG parser and the commonly used Chinese syntactic parsers. Analysis reveals that deep parsing for Chinese is more challenging than for English, due to the shortage of syntactic constraints of Chinese verbs, the widespread pro-drop, and the large distribution of ambiguous constructions. Moreover, the inherent ambiguities caused by verbal coordination and relative clauses make semantic analysis of Chinese more difficult than the syntactic analysis of Chinese.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation Reportof the third Chinese Parsing Evaluation: CIPS-SIGHAN-ParsEval-2012

This paper gives the overview of the third Chinese parsing evaluation: CIPS-SIGHANParsEval-2012, including its parsing sub-tasks, evaluation metrics, training and test data. The detailed evaluation results and simple discussions will be given to show the difficulties in Chinese syntactic parsing.

متن کامل

Evaluation Reportof the fourth Chinese Parsing Evaluation: CIPS-SIGHAN-ParsEval-2014

This paper gives the overview of the fourth Chinese parsing evaluation: CIPS-SIGHANParsEval-2014, including its parsing, evaluation metrics, training and test data. The detailed evaluation results and simple discussions will be given to show the difficulties in Chinese syntactic parsing.

متن کامل

An Algorithm Combining Statistics-based and Rules-based for Chunk Identification of Chinese Sentences

Natural language processing (NLP) is a very hot research domain. One important branch of it is sentence analysis, including Chinese sentence analysis. However, currently, no mature deep analysis theories and techniques are available. An alternative way is to perform shallow parsing on sentences which is very popular in the domain. The chunk identification is a fundamental task for shallow parsi...

متن کامل

Comparative Analysis of the Kurdish Problem in Turkey and the Issue of Chinese in Malaysia within the Context of Nation-State and Ethnic Differences: Advantages and Disadvantages in terms of Turkey

In this study, the ethnic problems which are one of the most significant and perhaps the primary structural difficulties and problems of the nation-state and possible solutions will be suggested. Even if it starts with the general information, the focus of this study would be the subject matter which is known as the Southeastern Problem in Turkey yet it has started to be mentioned as the Kurdis...

متن کامل

Grammatical Relations in Chinese: GB-Ground Extraction and Data-Driven Parsing

This paper is concerned with building linguistic resources and statistical parsers for deep grammatical relation (GR) analysis of Chinese texts. A set of linguistic rules is defined to explore implicit phrase structural information and thus build high-quality GR annotations that are represented as general directed dependency graphs. The reliability of this linguistically-motivated GR extraction...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011